Digit Recognition Using the SPEECHDAT Corpus

نویسندگان

  • Frederico Rodrigues
  • Isabel Trancoso
چکیده

With the remarkable evolution of telecommunications as we reach the end of this century, it becomes clear that speech recognition via the telephone network will play an increasingly important role, mainly due to the widespread use of both cellular and non-cellular telephones. For many applications of speech recognition over the telephone, digit recognition is fundamental. This paper describes a set of digit recognition experiments with the SPEECHDAT corpus for European Portuguese. We present techniques and results obtained with isolated and connected digits with both known and unknown length grammars. Error rates of 0.6% and 1,9% were achieved, respectively, for isolated digit and connected digit strings.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Quantile based histogram equalization for online applications

The noise robustness of automatic speech recognition systems can be increased by transforming the signal to make the cumulative density functions of the signal’s values in recognition match the ones that where estimated on the training data. This paper describes a real–time online algorithm to approximate the cumulative density functions, after Mel scaled filtering, using a small number of quan...

متن کامل

Development of the estonian speechdat-like database

A new database project has been launched in Estonia last year. It aims the collection of telephone speech from a large number of speakers for speech and speaker recognition purposes. Up to 2000 speakers are expected to participate in recordings. SpeechDat databases, especially Finnish SpeechDat, have been chosen as a prototype for the Estonian database. It means that principles of corpus design...

متن کامل

Development of a Real-time Asr System for Slovak Speechdat Database

This paper describes development of a real-time speech recognition system in Slovak for the voice-operated telephone services. The system is based on SPHINX2 platform. The decoder using Hidden Markov Models was trained on the SpeechDat-E Slovak database. It is speaker independent, large vocabulary, continuous speech real-time automatic speech recognition system. Test results are given for the t...

متن کامل

Monolingual and Bilingual Spanish-Catalan Speech Recognizers Developed from SpeechDat Databases

Under the SpeechDat specifications, the Spanish member of SpeechDat consortium has recorded a Catalan database that includes one thousand speakers. This communication describes some experimental work that has been carried out using both the Spanish and the Catalan speech material. A speech recognition system has been trained for the Spanish language using a selection of the phonetically balance...

متن کامل

Some Experiments on the Use of One-channel Noise Reduction Techniques with the Italian Speechdat Car Database

In this work the use of noise reduction techniques for handsfree speech recognition in car environment is investigated. A set of experiments was carried out using different speech enhancement algorithms based on noise estimation. In particular, linear subtraction and MMSE estimators are considered in their various configurations, which depend on a different set of parameters. Experiments were c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001